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1. 


Introduction. 


By  a graph,  we  shall  mean  what  is  sometimes  called  a Michigan  graph: 
one  that  is  finite,  undirected,  without  loops  and  multiple  edges.  A set 
S of  vertices  in  a graph  G is  called  independent  or  stable  if  no  two 
vertices  in  S Eire  adjacent;  the  largest  cardinality  a(G)  of  a stable 
set  in  G is  called  the  stability  number  of  G . Now,  let  G be  a graph 
and  let  t be  a positive  integer  such  that 

a(G)  < t ; (1.1) 

how  laborious  is  it  to  verify  a proof  of  (l.l)V  Of  course,  this  question 
has  a direct  bearing  on  the  conjecture  that  P ^ NP  ; in  particular,  the 
celebrated  theorem  of  Cook  [2]  suggests  that  it  is  extremely  time-consuming 
to  verify  proofs  of  (1.1).  We  shall  refrain  from  elaborating  on  this 
interesting  point;  instead,  we  direct  the  reader  to  [2],  [14]  and  [1], 

As  for  evaluating  a(G)  , the  best  available  algorithm  is  due  to  Tarjan 
and  Trojanowski  [10]:  its  running  time  on  a graph  of  order  n is  0(2n/^)  . 

The  framework  of  the  present  paper  is  quite  modest:  restricting  the 

intuitive  notion  of  a proof  rather  drastically,  we  shall  study  the  resulting 
system  of  "recursive  proofs".  This  system  remains  powerful  enough  to 

(i)  encompass  a certain  class  of  algorithms  that  includes  the  Tarjan- 
Trojanowski  algorithm, 

(ii)  provide  very  short  proofs  of  (1.1)  for  every  graph  G whose  set  of 
vertices  can  be  covered  by  a(G)  cliques. 

Nevertheless,  we  shall  show  that  there  are  valid  inequalities  (1.1)  whose 
proofs  must  be  excessively  long.  More  explicitly,  for  every  sufficiently 
large  d there  is  a positive  e with  the  following  property:  for  an 

overwhelming  majority  of  all  graphs  G with  n vertices  and  dn  edges 
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there  are  valid  inequalities  (l.l)  whose  recursive  proofs  must  have 
length  at  least  (l+e)n  . (The  assumption  that  the  number  of  edges  of 
G grows  linearly  with  n is  crucial:  in  fact,  the  conclusion  fails  as 

soon  as  d is  allowed  to  grow  beyond  every  bound.  For  details,  see 
Proposition  4.1.) 

At  this  moment,  it  may  be  worth  pointing  out  two  shortcomings  that 
practitioners  sometimes  find  in  results  on  computational  complexity: 
the  worst  case  criterion  and  the  asymptotic  point  of  view.  The  first  of 
these  objections  does  not  apply  to  our  result  at  sill  but  the  second  one 
certainly  does:  the  numerical  values  of  e are  very  small.  (One  could 

improve  on  them  by  talcing  a little  more  care  in  the  computations  but 
even  then  they  probably  would  not  be  very  impressive.) 

In  Section  2,  we  point  out  those  properties  of  random  graphs  which 
appear  in  the  proof  of  the  main  result:  looking  at  small  subgraphs  of  G , 

and  then  extrapolating  in  a straightforward  way,  one  would  expect  a(G)  to 
be  much  larger  than  it  actually  is.  In  that  sense,  a(G)  is  very  much  a 
"global  parameter".  And  it  is  precisely  this  global  character  which  makes 
the  proofs  of  (1.1)  so  long.  In  Section  3,  we  describe  a certain  class  of 
crude  algorithms  for  evaluating  a(G)  and  then  touch  briefly  upon  the 
more  sophisticated  algorithm  of  Tarjan  and  Trojanowski.  That  section 
provides  the  motivation  for  the  definition  of  a recursive  proof  presented 
in  Section  4.  The  exponential  that  appears  in  our  main  result  originates 
from  an  upper  bound  on  the  tail  of  the  hypergeometric  distribution;  it 
finds  its  way  into  the  theorem  via  a lemma  on  binary  trees  which  we  set 
aside  in  Section  5* 
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In  the  context  of  another  NP-complete  problem  (namely,  that  of 
satisfiability  of  Boolean  expressions),  there  are  many  results  similar 
in  spirit  to  ours;  most  of  them  can  be  found  in  [3].  In  particular,  the 
proof  system  investigated  recently  by  Galil  [11]  is  very  much  like  ours; 
however,  the  similarity  does  not  extend  beyond  the  superficial  level. 


U 


2.  Random  Graphs. 


In  this  section,  we  shall  deal  with  graphs  whose  vertices  are  labeled 

as  v,  ,v„,  ...,v  . Two  such  graphs  may  be  distinct  even  if  they  are 

i d n 

isomorphic;  hence  their  total  number  is  2n^n  ^ ^ . If  P is  a property 
which  a graph  may  or  may  not  have  then  we  shall  denote  by  t(p,n)  the 
number  of  those  graphs  with  n vertices  which  do  have  the  property. 
Finally,  we  shall  say  that  almost  all  graphs  have  the  property  P if  the 
ratio  t(P,n)/2n^n_1^'/2  tends  to  one  as  n tends  to  infinity.  A typical 
statement  of  this  kind  appears  in  the  following  lemma.  The  lemma  itself 
seems  to  be  a part  of  the  graph-theoretical  folklore.  It  appears  at  least 
implicitly  in  a I9U7  paper  by  Erdtts  [5];  further  refinements  can  be  found 
in  works  of  Matula  [17],  Grimmett  and  McDiarmid  [12],  ErdBs  and  Bollobas 
[81  and  perhaps  others. 


Lemma  2.1.  Almost  all  graphs  G of  order  n have  the  property  that 
a(G)  < 21og  n / log  2 . 


Proof.  Denote  2 log  n / log  2 , rounded  up  to  the  nearest  integer,  by 
k(n)  . Clearly,  the  number  of  those  graphs  of  order  n for  which  a > k , 
divided  by  the  number  of  all  graphs  of  order  n , does  not  exceed 


( J )2-k(k-1>A  . 


By  elementary  estimations,  (2.1)  is  at  most 


(2.1) 


g-(k-i)/2  jk 


For  all  sufficiently  large  n , we  have 


(2.2) 


22.  = e21/2/k  < .99 


and  so  (2.1)  tends  to  zero  as  n tends  to  infinity. 


In  the  theory  of  random  graphs  developed  by  Erdtts  and  Renyi  [8], 
[9)>  [10],  one  investigates  graphs  with  n vertices  and  m edges. 
Clearly,  the  number  of  such  graphs  is 


We  shall  denote  by  t(P,n,m)  the  number  of  those  graphs  with  n vertices 

and  m edges  which  have  some  property  P . If  m is  a function  of  n 

such  that  each  m(n)  is  a nonnegative  integer  not  exceeding  n(n-l)/2 

and  if  the  ratio  of  t(P,n,m)  to  (2.3)  tends  to  one  as  n tends  to 

infinity  then  we  shall  say  that  almost  all  graphs  with  n vertices  and  m 

edges  have  the  property  P . The  following  lemma  has  been  used  by  Erd8s  in 

[t>]  and  elsewhere.  (Throughout  the  paper,  log  denotes  the  natural  logarithm.) 


Lemma  2.2.  If  m(n)  > l6n  for  all  sufficiently  large  n then  almost 
all  graphs  G with  n vertices  and  m edges  have  the  property  that 


a(G) 


, n , m 
< — log  - 
m n 


(2.4) 


Proof.  Denote  the  right-hand  side  of  (2.4),  rounded  up  to  the  nearest 
integer,  by  k(n)  ; note  that  k(n)  — <*>  as  n -•  ® . Clearly,  the  number 
of  those  graphs  with  n vertices  and  m edges  for  which  a > k , divided 
by  the  number  of  all  graphs  with  n vertices  and  m edges,  does  not  exceed 


By  elementary  estimations,  (2.5)  does  not  exceed 
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< (f  *4-^))k 


In  addition,  we  have 

en  f m(k-l)  \ em  f . m , m \ 

k eXp(  " n(n-l)  ) n log(m/n)  eXp^  °g  n ^2  j 

iince  the  last  c lantity  becomes  smaller  than  .99  for  all  sufficiently 
large  n , we  conclude  that  (2.5)  tends  to  zero  as  n tends  to  infinity,  u 

Next,  let  us  digress  a little.  When  m , n , s are  nonnegative  integers 

such  that  m < n and  when  t is  a positive  real  number,  we  shall  set 

* 

p = m/n  , denote  by  £ the  summation  over  all  integers  j > s(p+t)  and 
define 


Thus  B is  the  familiar  "tail  of  the  binomial  distribution"  and  H is  the 
"tall  of  the  hypergeometric  distribution".  The  well-known  interpretation 
of  these  quantities  goes  as  follows.  Imagine  a barrel  containing  n apples, 
exactly  m of  which  are  rotten;  take  a random  sample  of  s apples. 
Technically,  the  sampling  can  be  done  in  at  least  two  ways.  We  might  pick 
and  examine  the  apples  one  by  one,  each  time  throwing  the  apple  back  into 
the  barrel  before  reaching  in  again:  this  is  called  sampling  with 

replacement.  Or  we  might  just  grab  the  s apples  at  the  same  time: 
that  is  called  sampling  without  replacement.  Whichever  method  we  use,  we 
should  expect  about  ps  rotten  apples  in  the  sample.  The  quantities  B 
and  H give  the  probability  that  at  least  (p+t)s  rotten  apples  will 
appear  in  the  sample  with  and  without  replacement,  respectively. 
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An  elegant  argument  (apparently  due  to  S.  N.  Bernstein)  shows  that 


B(m,n,s,t)  < ((^)  (]^t)  P j • 

A similar  bound  for  H seems  to  be  far  more  difficult  to  establish. 

A special  case  of  a theorem  of  Hoeffding  ([13],  Theorem  U)  states  that 

H(m,n,s,t)  < ((^t)  ) ' (2-6) 

It  is  a routine  matter  to  convert  (2.6)  into  weaker  but  more  tractable 
bounds;  we  are  about  to  do  that  for  t = p . 

Lemma  2.3.  H(m,n,s,m/n)  < e'”13/1*11  . 

Proof.  If  p > 1/2  then  the  left-hand  side  vanishes.  If  p < 1/2  then 
(2.6)  implies 

| log  H(m,n,  s,p)  < 2 p log  | + (l-2p)  log  ( 1 + j 

< 2p  log  i + p < -p/1* 

which  is  the  desired  conclusion.  □ 


Upper  bounds  on  H are  useful  in  proving  statements  about  random 
graphs,  such  as  the  following  one. 

Lenina  2.4.  Almost  aid  graphs  G with  n vertices  and  m edges  have  the 
following  property:  every  subgraph  of  G induced  by  s vertices  such  that 
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Un  , m 
s > log  - 

— m n 

2 2 

has  fewer  than  2ms  /n  edges. 


(2.7) 


Proof.  Clearly,  the  number  of  those  graphs  which  do  not  have  the  property, 
divided  by  the  number  of  ail  graphs  with  n vertices  and  m edges,  does 


not  exceed 


T ( s )«  (_  ( | ) , (|),m,  (|)/(“)  ) . 


(2.8) 


By  Lemma  2.3,  this  quantity  does  not  exceed 


/ n \ ( ms(s-l)  ^ f en  f 

(s}  ex*v K - (irexv 


m(s-l) 

i 2 


By  (2.7 )>  we  have 


en  f m(s-l 


^ em  m , m \ 

- 1+n  log(m/n)  ^2  °g  n J ' ^ 


Hence  (2.8),  being  bounded  from  above  by 


(.99)s  < 100(.99)1+n  log(m/n)/m 


tends  to  zero  as  n tends  to  infinity.  U 
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3. 


Algorithms. 


In  this  section,  we  shall  first  describe  a class  of  crude  algorithms 
for  finding  a largest  stable  set  in  a graph  and  point  out  that  by  the  use 
of  appropriate  data  structures,  the  running  time  of  these  algorithms  can 
be  cut  down  considerably.  Then  we  shall  briefly  outline  a class  of  more 
sophisticated  algorithms  which  we  shall  call  Tarjan  algorithms. 

Let  us  suppose  that,  given  a graph  G = (V,  E)  and  a subset  S of  V , 

we  wish  to  find  a largest  stable  subset  A of  S . We  may  begin  by 

choosing  a vertex  veS  ; the  desired  set  A either  does  not  contain  v 
or  it  does  contain  v . In  the  first  case,  A is  a largest  stable  subset 
of  the  set  = S-{v}  ; in  the  second  case,  A-fv}  is  the  largest  stable 
subset  of  the  set  Sg  obtained  from  S by  deleting  v with  all  of  its 
neighbors  in  S . We  shall  denote  by  S-v  and  Sg  by  S*v  ; with 

this  notation,  we  have 

a(S)  = max(a:(S-v)  , 1 +a(s*v) ) 

Thus  we  have  reduced  the  original  problem  into  two  similar,  but  smaller, 
subproblems:  one  for  S-v  and  the  other  for  S*v  . 

Now,  an  algorithm  for  finding  a largest  stable  set  in  G suggests 

itself : begin  with  S = V , do  what  we  have  just  done  and  then  simply 

iterate  away.  One  may  visualize  a binary  tree  with  nodes  labeled  by 
subsets  of  V . The  root  is  labeled  by  V itself;  if  a node  is  labeled 
by  a nonempty  set  S then  its  left  son  is  labeled  by  S-v  and  its  right 
son  is  labeled  by  S*v  for  some  veS  . If  G has  n vertices  altogether 
and  if  each  vertex  has  fewer  than  d neighbors  then  the  tree  will  have  at 
least  2n^d  nodes.  Of  course,  that  does  not  mean  that  the  algorithm  will 
create  at  least  2°^  subproblems:  different  nodes  of  the  tree  may  have 
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the  same  label.  (To  take  an  extreme  example,  note  that  all  the  leaves 
of  the  tree  will  be  labeled  by  <f>  . ) 

We  shall  describe  a possible  implementation  of  the  algorithm.  For 
definiteness,  let  us  assume  that  we  have  a fixed  "choice  function"  f 
which  assigns  to  each  nonempty  subset  S of  V a vertex  f(S)  e S . 

Such  a function  gives  rise  to  an  algorithm  which  we  shall  call  the 
f -driven  algorithm. 

In  its  first  phase,  the  algorithm  creates  a list  of  certain  subsets 
of  V , which  will  be  called  subproblems . It  will  be  convenient  to  keep 
the  list  ordered,  with  larger  subproblems  preceding  the  smaller  ones; 
within  each  group  of  subproblems  of  the  same  size,  the  order  may  be 
lexicographic.  At  each  moment,  we  shall  have  a partial  list  of  subproblems, 
with  a pointer  at  one  of  them.  At  the  very  beginning,  V will  be  the 
only  subproblem  on  the  list;  the  first  phase  will  terminate  as  soon  as 
the  pointer  gets  to  0 . When  the  pointer  is  at  a nonempty  set  S , 
we  define  = S-f(s)  and  = S*f(S)  . Then  we  add  and  on 

the  list  (unless  they  are  already  present),  shift  the  pointer  to  the 
successor  of  S and  iterate. 

In  the  second  phase,  we  pass  through  the  list  in  a reverse  order 
(from  <p  to  V ) and  evaluate  a(G)  for  each  subproblem  S . To  begin 
with,  we  have  a(0)  = 0 ; for  each  nonempty  subproblem  S , we  have 
a(S)  = max(a(G1)  , 1 +a(S2))  . 

In  the  third  phase,  we  shall  find  a largest  stable  set  A in  G . 

To  begin  with,  let  us  set  A = JZ)  and  S = V . With  each  iteration,  the 
set  S will  shrink;  when  it  will  become  empty,  A will  be  the  desired 
largest  stable  set  in  G . Each  iteration  is  simple.  If  a(S)  = 
then  we  replace  S by  ; otherwise  a(s)  = l+a(S2)  in  which  case  we 
add  f(S)  to  A and  replace  S by  S0  . 


n 


It  is  crucial  to  use  the  appropriate  data  structures  when  i'  ■piemen ting 


the  first  phase.  Trivially,  the  number  of  subproblems  on  the  list  never 
exceeds  2n  . If  we  implement  the  list  as  a balanced  tree  (see  [15]  or  [1]) 
then  each  of  the  look-ups  and  insertions  can  be  handled  within  a number 
of  set-comparisons  proportional  to  n . If  each  f(S)  can  be  evaluated 
within  a steps  and  if  the  total  number  of  subproblems  is  b then  the 
running  time  of  the  algorithm  is  O(abn^)  . For  at  least  a few  choices 
of  f that  come  to  mind,  a is  polynomial  in  n . In  that  case,  b 
threatens  to  be  the  decisive  factor  in  the  upper  bound. 

Needless  to  say,  the  number  of  subproblems  depends  on  the  choice 
function  f ; for  most  functions  f , that  number  seems  difficult  to 
estimate.  To  simplify  the  situation,  we  shall  restrict  ourselves  to 
very  special  choice  functions:  when  the  vertices  of  G are  ordered  as 

Vf,  Vg,  functi°n  f chooses  that  vertex  of  S which  has  the 

smallest  subscript.  The  resulting  f-driven  algorithm  will  be  called  an 
order-driven  algorithm. 

The  following  proposition  and  its  corollaries  (Propositions  3.2 -3.5) 
are  due  to  Szemeredi.  In  its  statement,  N(k)  denotes  the  number  of 
stable  subsets  of  [v^, v^, . . ., v,}  . Here  and  later  on,  we  shall  find  it 
convenient  to  denote  by  S*T  the  subset  of  S resulting  when  all  the 
vertices  in  T and  all  their  neighbors  are  deleted. 

Proposition  3«1«  The  order- driven  algorithm  applied  to  a graph  with 

vertices  v^, vn, . . ., vn  creates  at  most 

n-l  . , 

1 + Z)  min(N(k)  , 2n~  ” ) 
k = 0 

subproblems . 
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Proof.  For  each  subproblem 


let  k be  the  largest  subscript 


such  that  [v1,v0,...,vkJnS  = fi  . It  is  not  difficult  to  see  that 
S = lvktl»vkf2'"*'vn)*B 

for  some  stable  subset  B of  {v^, v2» • • •> v^}  . Hence  for  each  fixed  k , 
there  are  at  most  N(k)  subproblems  S . In  addition,  if  k < n then 
there  are  only  2n-k_1  subsets  S of  [vk+1, . . vn}  such  that  v.+.  e S . □ 


Proposition  3,2.  The  order-driven  algorithm  applied  to  a graph  G of 
order  n such  that  a(G)  < n/2  creates  at  most 
2,  n \ 

" <a(0)> 

subproblems . 


Proof.  Trivially,  we  have 


“<k)  - i?o  5 "‘“(V 


for  each  k ; the  rest  follows  from  Proposition  3«1» 


Proposition  3.3.  For  almost  all  graphs  G of  order  n , the  order- 
driven  algorithm  creates  at  most 


2(1  + log  n/log  2) 


subproblems . 


(The  proof  follows  immediately  from  Proposition  3*2  and  Lemma  2.1.) 


Proposition  3«^«  If  m(n)/n  -•  ® then  almost  all  graphs  G with  n 
vertices  and  m edges  have  the  following  property:  for  every  constant 

c > 1 , the  order-driven  algorithm  on  G creates  o(cn)  subproblems. 
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I 'roof . 


By  Lemma  2.2,  we  have  a(G)  = o(n)  for  almost  all  graphs  with 


n vertices  and  m edges;  the  rest  follows  from  Proposition  3.2.  u 

Proposition  3.5.  For  every  graph  with  n vertices,  the  order-driven 
algorithm  creates  at  most  3*2^n  ^ -1  subproblems. 


ir oof.  We  have 


T.  min(N(k)  , 2n  k ^)  < T min(2k , 2n-k-1)  < 3*2 
k = 0 k = 0 


(n-l)/2 


-2  ; 


the  rest  follows  from  Proposition  3.1.  □ 


Note  that  the  bound  of  Proposition  3.5  is  sharp:  it  is  attained 

by  the  graph  with  vertices  vp>  vp>  • • •>  v0mfp  and  edges 
vlV2m+l’V2V2m-..»VmV2  * Nevertheless,  if  we  can  choose  the  ordering 
of  the  vertices  then  the  bound  can  be  improved. 


Proposition  3.6.  Every  graph  with  n vertices  can  be  ordered  in  such 
a way  that  the  order-driven  algorithm  creates  0(n2^n^)  subproblems. 


Proof.  We  shall  first  describe  the  ordering  and  then  we  shall  show  that 
it  has  the  desired  property.  Suppose  that  we  have  already  constructed 
the  initial  segment  v^v^,..^^  for  some  t > 0 . If  the  graph 
H = G-{v1, v2,  contains  a path  w^w^w^w^  then  we  set  = wi 

for  1 < i < and  iterate.  Otherwise  each  component  of  H is  a star 
or  a triangle.  In  that  case,  we  denote  4t  by  m and  enumerate  the 
vertices  of  H as  vm+1,  v^g,  * • •>  vn  in  such  a way  that 

(i)  the  vertices  of  each  component  of  order  j are  enumerated  as 
vi+l>  Vi+2J  * * vi+ j for  come  i> 

(ii)  if  that  component  is  a star  then  vi+1  is  its  center. 
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It  is  not  difficult  to  verify  that  N(k)  < 2^k  f0r  each 

k * 1,2,  . . . ,m  . If  m > Un/7  then 

S min(N(k)  , 2n"k_1)  = 0(n25n/?)  . 

k = 0 

If  m < Un/7  then  we  resort  to  another  argument:  note  that  each 

subproblem  has  the  form  {v^+^,  v^+oj  • ♦ •>  vn]*B  such  that  1 < k < n and 
B is  a stable  subset  of  {v^, Vg, . ••>vm)  • Since  N(m)  < , 

the  total  number  of  subproblems  is  0(n2^n^)  . □ 

It  is  not  unlikely  that  the  bound  of  Proposition  3*6  can  be  improved. 

Let  us  call  a number  c admissible  if  every  graph  with  n vertices  can 

be  ordered  in  such  a way  that  the  order-driven  algorithm  creates  0(cn) 

subproblems;  let  cQ  denote  the  infimum  of  all  admissible  c . By 

3 h 

Proposition  3.6,  we  have  cQ  < 2^'  ; on  the  other  hand,  the  main  result 

of  this  paper  implies  that  Cq  > 1 . What  is  the  exact  value  of  c^  ? 
Similar  questions  apply  to  the  wider  class  of  f-driven  algorithms  and 
to  the  even  wider  class  of  Tar.jan  algorithms  which  we  are  about  to  outline. 

As  pointed  out  at  the  beginning  of  this  section,  every  f-driven 
algorithm  applied  to  a graph  gives  rise  to  a binary  tree  whose  nodes  are 
labeled  by  subproblems:  if  a node  x is  labeled  by  a nonempty  subproblem 
S then  the  left  son  of  x is  labeled  by  S-v  and  the  right  son  of  x 
is  labeled  by  S*v  for  some  veS  . Elimination  of  duplications  on  the 
list  of  subproblems  amounts  to  pruning  the  tree;  we  simply  omit  nodes 
whose  presence  would  result  in  duplicated  labels.  The  idea  of  Tarjan  [19] 
leads  to  pruning  of  a different  kind.  In  an  f-driven  algorithm,  each 
subproblem  S is  generated  in  the  form  (V-A)*B  such  that  B is  a stable 
set;  eventually,  such  a subproblem  yields  a stable  set  of  size  a(S)+|B|  . 
If  another  subproblem  is  generated  in  the  form  (V-A^)*B^  such  that 
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S^c  S and  |B^|  < |b|  then  can  be  discarded:  in  a sense,  S^  is 

dominated  by  S . In  terms  of  the  binary  tree,  we  might  index  each  node  x 
by  the  number  r of  right-hand  turns  on  the  path  from  the  root  to  x ; 
a branch  rooted  at  a node  x^  (labeled  by  and  indexed  by  r^  ) 

may  be  pruned  off  whenever  there  is  another  node  x (labeled  by  S and 
indexed  by  r ) such  that  S-^  c S and  r^  < r . 

Now  we  have  arrived  at  two  kinds  of  pruning:  these  might  be  called 

"duplication  pruning"  and  "dominance  pruning",  the  former  being  (in  a sense) 
a special  case  of  the  latter.  An  f-driven  algorithm  with  the  option  of 
using  both  duplication  pruning  and  dominance  pruning  to  eliminate  subproblems 
will  be  called  a Tarjan  algorithm.  Of  course,  systematic  use  of  dominance 
pruning  may  shorten  the  list  of  subproblems  quite  considerably.  In  terms 
of  running  time,  however,  the  means  could  defeat  the  purpose:  in  general, 

it  may  take  a very  long  time  to  decide  whether  the  subproblem  that  has 
been  just  created  is  dominated  by  at  least  one  of  the  subproblems  already 
on  the  list.  Thus  it  may  be  wise  to  pass  up  the  option  of  (possible) 
dominance  pruning  in  most  cases,  resorting  to  it  only  in  those  simple 
situations  where  the  dominating  subproblem  is  almost  staring  at  us.  Such 
a strategy  led  Tarjan  [19]  to  an  algorithm  whose  worst-case  running  time 
for  a graph  with  n vertices  is  0(l.286n)  . Later  on,  Tarjan  and 
Trojanowski  [20]  designed  an  improved  version  of  that  algorithm  with 
running  time  0(2  ' J)  . It  may  be  worth  pointing  out  that  these  upper 
bounds  cone  out  of  rather  rudimentary  applications  of  dominance  pruning 
only:  the  argument  does  not  take  duplication  pruning  into  account  at  all. 

Thus,  it  is  not  inconceivable  that  (with  the  subproblems  kept  in  a balanced 
tree,  so  that  duplication  pruning  is  easy  to  implement)  the  worst-case 
running  time  of  the  Tarjan -Trojanowski  algorithm  is  even  better  than 
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0(2n/3)  , Nevertheless,  the  main  result  of  this  paper  implies  the 
existence  of  a constant  c > 1 and  arbitrarily  large  graphs  G with 
n vertices  such  that  every  Tarjan  algorithm  applied  to  G must  create 
at  least  c11  different  subproblems.  (in  fact,  almost  all  graphs  with  n 
vertices  and  dn  edges  have  this  property  as  long  as  d is  sufficiently 
large. ) 

One  more  comment:  from  the  practical  point  of  view,  the  Tarjan  - 

Trojanowski  algorithm  might  be  preferable  even  to  (hypothetical)  f-driven 
algorithms  creating  cn  subproblems  for  c fairly  close  to  1 . The 
point  is  that  the  space  requirements  of  such  algorithms  would  be  roughly 
nc11  whereas  the  space  required  by  the  Tar j an  - Trojanowski  algorithm  is 
only  polynomial  in  n . 
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4*1  Recursive  Proofs. 


For  the  moment,  let  us  deal  with  an  arbitrary  but  fixed  graph 
G = (V,  E)  . By  a statement,  we  shall  mean  an  ordered  pair  (S,t)  such 
that  S is  a subset  of  V and  t is  a nonnegative  integer.  (Such  a 
statement  is  to  be  interpreted  as  the  inequality  a(S)  < t which,  of 
course,  may  be  true  or  false.)  By  a recursive  proof  of  a statement 
(S,  t)  over  G , we  shall  mean  a sequence  of  statements 

(5^,t^)  , i = 0, 1, . . . , ra  (4.1) 

such  that  (S0,t0)  = ft  , = (S,  t)  and  such  that  each  statement 

(S^, t^)  with  k > 1 can  be  derived  from  the  previous  statements  (s^t^)  , 
0 < i < k , by  at  least  one  of  the  following  two  rules. 

1.  The  dichotomy  rule:  from  (S-v, t^)  and  (S*v, t^.)  we  can  derive 

(S  , max(t^,  l+t j ) ) . 

2.  The  monotone  rule:  from  (S, t)  we  can  derive  (S',t')  whenever 
S'  c S and  t'  > t . 

Clearly,  if  (4.1)  is  a recursive  proof  of  (S, t)  then  a(S^)  < t^ 
for  every  i ; in  particular,  a(s)  < t . Conversely,  if  a(S)  < t then 
there  is  a recursive  proof  of  (S, t)  . In  order  to  see  that,  consider  the 
family  F of  subproblems  created  by  some  f-driven  algorithm  that  has 
just  found  a largest  stable  subset  of  S . Enumerate  all  the  ordered  pairs 
(S*>ct(S*))  with  S*cF  as  (4.1)  in  such  a way  that  |S^|  < |S^+1|  for 
every  i . Clearly,  the  resulting  sequence  constitutes  a recursive  proof 
of  (S,a(s))  ; if  t > a(S)  then  one  additional  application  of  the 
monotone  rule  completes  a recursive  proof  of  (S, t)  . 

It  will  be  convenient  to  define  the  length  of  (4.1)  as  m . Now, 
Propositions  3*1  -5*6  yield  direct  corollaries  in  terms  of  recursive 
proofs.  We  shall  state  explicitly  only  one  of  them. 
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then,  for  almost  all 


Proposition  1.1.  If  c > 1 and  if  m(n)/n  — ^ 

graphs  G = (V, E)  with  n vertices  and  m edges,  there  are  recursive 
proofs  of  (V,cc(G))  of  1 ngth  o(cn)  . 

In  addition,  eve  'an  algorithm  applied  to  G = (V, E)  yields 

a recursive  proof  of  (V,a(G))  . Hence  for  every  graph  G = (V,  E)  of 
order  n , there  is  a recursive  proof  of  (V,a(G))  of  length  0(2n^')  . 

Now,  we  shall  show  that  for  a certain  class  of  graphs  G = (V,  E)  , 
there  exist  very  short  recursive  proofs  of  (V,o:(g))  . This  class 
consists  of  all  those  graphs  G for  which  a(G)  equals  9(G)  , the 
smallest,  number  of  cliques  whose  union  is  V . (Trivially,  we  have 
a(G)  < 9(g)  for  every  graph  G .)  It  may  be  instructive  to  split  the 
argument  into  three  easy  propositions. 

proposition  4.2.  If  G = (V,  E)  is  a complete  graph  of  order  n then 
there  is  a recursive  proof  of  (V, l)  whose  length  is  n . 

Proof.  En  aerating  the  vertices  of  G as  v^, v0, . . ., v^  , define 

Si  = lvi^ v2’ • * Trivially,  the  sequence  (0, 0), (S^ 1), . . . , (Sn, 1) 
constitutes  a recursive  proof.  □ 

Proposition  U.%  Let  G^  = (V^, E^)  and  G^  = (V^, E0)  be  graphs  such 

that  OVp  = 0 ; let  G-^UGg  denote  the  graph  (V^  |JV2  , U EL)  . If 

there  are  recursive  proofs  of  (V  ,a(G.))  of  length  m for  each 

0 J j 

j = 1,2  then  there  is  a recursive  proof  of  (V-^  U V0  , a(G^ ) + a(G? ) ) whose 
length  does  not  exceed  irij+np,  . 

Proof.  If  (S^,t^)  with  i = 0,1,  ...,m  is  a recursive  proof  of 
(V?,a(G2))  then  a recursive  proof  of  (V]L,a(Gi))  , followed  by  the  sequence 
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( V U , Qi(G^ )+t^)  , i - 1)2,  , nig 

constitutes  a recursive  proof  of  (V-^UVg  , a(G^)  + a(Gg))  . i_i 

proposition  U.U.  Let  F be  a subgraph  of  G and  let  (H.l)  be  a recursive 
proof  over  F . Then  there  is  a recursive  proof  of  (S  , t ) over  G 
whose  length  does  not  exceed  2m  . 


Froof . We  shall  create  the  desired  proof  over  G from  (k.l)  by  inserting 

a new  statement  immediately  before  each  that  has  been  obtained 

from  the  previous  statements  by  the  dichotomy  rule.  For  every  such  (S^, t^)  , 

there  are  subscripts  i , j and  a vertex  veS^  such  that  i < k , j < k , 

t,_  = max(t_.  , 1+t,)  and  S.  = S,_-v  , = S,_*v  in  F . The  statement  to 

K 1 J 1 K J K 

be  inserted  immediately  before  (S£  , t^-l)  such  that 

3k  = Sk*v  in  G . Clearly,  (S^,  t^-1)  follows  from  (S^,tj)  by  the 
monotone  rule  whereas  ( S,  , t^ ) follows  from  (S^,t^)  and  (S^  , t^-1) 
by  the  dichotomy  rule.  □ 


Proposition  U.5.  For  every  graph  G = (V, E)  of  order  n there  is  a 
recursive  proof  of  (V  , 0(G))  whose  length  does  not  exceed  2n  . 

Proof.  Consider  the  subgraph  F of  G consisting  of  0(G)  cliques 
whose  union  equals  V . By  Proposition  h.2  and  by  repeated  applications 
of  Proposition  h.J,  there  is  a recursive  proof  of  (V  , 0(G))  over  F 
whose  length  equals  n . The  rest  follows  from  Proposition  I4.I4.  □ 

We  shall  close  this  section  with  another  easy  observation  which  will 
be  handy  later.  The  proof  can  be  left  to  the  reader. 
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Proposition  U.>  . If  (^.1)  is  a recursive  proof  over  G = (V, E)  and 
if  W c V then 

(S.  nw  , t.)  , i = 0,1,  ...,m 

is  a recursive  proof  over  the  subgraph  of  G induced  by  W . 
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A Lemma  on  Binary  Trees. 

Let  a , b , r , s be  nonnegative  integers,  A binary  tree  whose  nodes 
are  colored  red  and  blue  will  be  called  (a,  b, r, s)  -constrained  if,  along 
each  path  from  the  root  to  a leaf, 

exactly  a nodes  are  followed  by  their  left  sons  and  exactly  b 
nodes  are  followed  by  their  right  sons, 

(ii)  at  most  r nodes  are  red, 

( i i i > at  least  s red  nodes  are  followed  by  their  right  sons. 

If,  for  sane  choice  of  integers  a , b , r and  s , there  is  at  least 
one  (a, t,r, s)  -constrained  tree  then  we  denote  by  f(a, b,r,  s)  the  largest 
possible  number  of  leaves  in  such  a tree;  otherwise  we  set  f(a,b,r, s)  = 0 . 
Trivially,  we  have 

f (a, b,  r,  s)  < ( a^b  ) 

and 

f(a, b, r, s)  = 0 whenever  s > b or  s > r , 

The  purpose  of  this  section  is  to  derive  the  following  upper  bound  on 
f(a,b,r,s)  , 


Lemma  5.1.  If  s > 2br/(a+b-l)  , s > r-a+1  and  s > 1 then 

. , a+b  N ab  -br/U(a+b) 
f(a,b,r,  s)  < ( b ) _ e 


First  of  all,  we  shall  establish  a simple  recursive  bound. 

f(a-l,b,r,s)  + f(a,b-l,r,s) 


Fact  5.2. 


f(a, b, r, s)  < max 


f(a-l,b,r-l, s)  + f(a,b-l,r-l,  s-1) 


Proof.  Let  T be  an  (a, b,r,  s)  -constrained  tree.  If  its  root  is  blue 
then  the  left  subtree  is  either  empty  or  (a-l,b, r, s)  -constrained  and  the 
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right  subtree  is  either  empty  or  (a,b-l, r, s)  -constrained.  If  the  root 
is  red  then  the  left  subtree  of  T is  either  empty  or  (a-l,b, r-1,  s) 
-constrained  and  the  right  subtree  of  T is  either  empty  or  (a, b-1, r-1, s-1) 
-constrained.  Hence  the  desired  conclusion.  □ 


Next,  for  every  choice  of  nonnegative  integers  a , b , r , s such  that 
s<b  , s < r , s > r-a+1 

we  define 

F(a,b,r,s)  = £ ( s+i)(  ) • 

1=0 

It  is  easy  to  verify  that 

F(a,b,r,0)  = (a*b)  , 


F(a,b,r,b)  = (,  ) , 


b 

a+b-r 


F(a,b,r,r)  = ( ) , 


F(a,  b,r,  r-a+1)  = ( &*b  ) - ( £ ) , 

F(a-l,b, r, s)  + F(a,b-1, r,  s)  = F(a,b,r,s)  , 
F(a-1, b, r-1, s ) + F(a, b-1,  r-1, s-1)  = F(a,b,  r,  s) 


whenever  the  left-hand  side  terms  are  defined. 


Fact  5«3-  We  have  f(a, b,  r,  s)  < F(a, b,  r,  s)  whenever  the  rigiit-hand  side 
is  defined. 

This  inequality  can  be  proved  by  induction  on  a+b  in  a straightforward 
way;  we  omit  the  tedious  details.  It  is  not  unlikely  that  there  is  a 
direct  combinatorial  proof  of  Fact  5.3»  Furthermore,  it  is  not  difficult 
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to  .-how  that  f(a,b,r,s)  = F(a,b,r,s)  whenever  the  right-hand  ride  is 
defined;  however,  that  ir,  irrelevant  for  our  purpose. 

Proof  of  Lemma  5.1.  We  may  assume  s < b and  s < r for  otherwise 
the  left-hand  side  vanishes.  Then,  by  Fact  5.3> 
f (a,b,  r,  s)  < F(a,b,r,  s)  . 

Since  r > s > 2br/(a+b-l)  , we  have  2b/(a+b-l)  < 1 and 

for  every  nonnegative  i • Hence*  with  "the  notation  of  Section  2* 

^ Si)(  a+b-sli_i)  - H(r+i,  a+b-1, b,  (r+i)/ (a+b-1) ) • ( &+^_1  ) 
By  Lemma  2.3,  we  have 

b“s  , a+b  „ a -br/4(a+b) 
f(a,b,r,s)  < Z ( b ) ^ e ' 

i = 0 

which  implies  the  desired  result. 
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The  Main  Result. 


A graph  G of  order  n will  be  called  (d,  e)  -sparse  if 

(i)  every  vertex  of  G has  degree  less  than  d , 

(ii)  every  subgraph  of  G induced  by  m vertices  such  that  m > en 

2 

has  fewer  than  dm  /n  edges. 


Theorem  6.1.  Let  n , t be  positive  integers  and  let  d , t be  positive 
reals  such  that 

n < lOtd  , 

n > 500t2/5  d , 


n > 100t5/U  d5/1*  , 


n > 2000t 


e < n2/l8l0t2  d2  •, 


let  G = (V, E)  be  a (d, £)  -sparse  graph  of  order  n . Then  every  recursive 
proof  of  (V, t)  has  length  at  least 


90d  n 

n~  exF  f 

20d 


(6.1) 


proof.  We  shall  set 

a = Ln5/1*5000t2d2j  , b = |_n2/900td2j 

and  show  that  every  recursive  proof  of  (V,  t)  has  length  at  least 
2 

a+b  exn  b 

"aF  GXp  WE 

The  reader  may  easily  verify  that  a > b > 200  and  so 

. 200  r?  200  n2 

201  U5000t2d2  * 201  900td2 

Then  it  follows  that  (6.2)  is  at  least  (6.1). 


(6.2) 
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Let  us  outline  our  strategy.  With  each  recursive  proof  of  (V,  t)  > 
we  shall  associate  a binary  tree  T whose  nodes  will  be  labeled  by 
statements  from  the  proof.  The  assignment  of  labels  to  nodes  will  not 
be  one-to-one  (to  take  an  extreme  example,  all  the  leaves  of  T will  be 
labeled  by  0 ) and  so  the  number  of  nodes  of  T may  be  much  greater 
than  the  length  of  the  proof.  We  shall  find  a node  z with  a certain 
convenient  property  and  then  we  shall  construct  a new  binary  tree  T*  . 
Even  though  T will  not  be  a subtree  of  T in  a strict  sense,  its  node; 
will  come  from  T ; in  particular,  z will  be  the  root  of  T*  . Finally, 
we  shall  show  that  within  the  set  N of  leaves  of  T*  , no  label  is 
repeated  too  often.  More  precisely,  for  each  subset  S of  V we  shall 
define 


N(s)  = {xeN:  x is  labeled  by  (S, t')  for  some  t'  } 


and  prove  that 


lN(S)|  5 |N|  exp 


(6.5) 


Since  N will  be  nonempty,  (6.5)  will  imply  the  desired  result:  indeed, 

the  number  of  those  sets  S for  which  N(S)  0 0 must  be  at  least  (6.2). 

Before  going  into  the  details,  the  reader  may  welcome  a preview 

of  the  idea  behind  the  proof  of  (6,3),  however  vague  such  a preview  may 

have  to  be.  Let  (W, t*)  be  the  statement  that  labels  z . In  the 

* 

absence  of  the  monotone  rule,  the  tree  T is  constructed  in  such  a 
way  that  every  subproblem  S labeling  a leaf  of  T*  is  obtained  from 
W by  simply  deleting  a vertices  and  by  deleting  b vertices  with 
their  neighbors.  If  we  had  our  way,  the  subgraph  H induced  by  W-S 
would  consist  of  a isolated  vertices  and  b disjoint  stars:  in  that 


26 


case,  we  could  reconstruct  the  two  sets  of  vertices,  proving  that 
|N(S) | = 1 . Actually,  we  shall  be  content  even  if  things  axe  not  all 
that  clear-cut,  as  long  as  we  can  approximat ely  reconstruct  the  two  sets. 
That  will  be  the  case  as  long  as  H is  reasonably  large,  (if  H is 
large  then  most  of  the  b vertices  must  have  large  degrees.  At  the  same 
time,  the  second  defining  property  of  a (d, e)  -sparse  graph  implies 
that  the  average  degree  in  H is  rather  small.  Hence  the  vertices  of 
large  degrees  are  quite  conspicuous. ) In  order  to  guarantee  that  H will 
be  large,  we  have  to  choose  z appropriately.  In  general,  the  rules 
for  constructing  T*  are  designed  to  neutralize  the  desultory  effects 
of  the  monotone  rule.  Now  that  the  poor  reader  is  properly  confused, 
we  can  proceed  to  the  details. 

Constructing  T , we  shall,  find  it  convenient  to  call  certain  statement 
in  the  proof  eligible:  a statement  will  be  called  eligible  if  it  is  (0,0) 

or  if  it  follows  from  some  two  earlier  statements  by  the  dichotomy  rule. 

Only  the  eligible  statements,  with  a possible  exception  of  (V, t)  , will  be 
used  to  label  the  nodes  of  T . The  construction  of  T is  recursive;  the 
root  of  T is  labeled  by  (V, t)  . Suppose  that  we  have  constructed  a node 
x labeled  by  a statement  and  having  no  sons  at  this  moment. 

If  = (0>°)  then  x will  be  a leaf  of  T . Otherwise  there  are 

eligible  statements  (S^t^)  , (Sj,t^)  and  a vertex  veS^  such  that 
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i,  j < k , Si  2 2k"v  > S.  2 sk#v  * ^ 2 , 1+t .)  . 

In  that  case,  we  shall  create  both  sons  of  x , label  the  left  one  by 

(S. , t. ) and  label  the  right  one  by  (S.,t.)  . For  further  reference, 

11  J J 

we  shall  set  S(x)  = , t(x)  = tR  and  v(x)  = v . It  will  be  useful 

to  note  that 

t.  < t.  and  t . < t,  -1  . ('■.!+) 

i — k J - k 

Next,  we  shall  find  the  special  node  z . A node  y will  be  called 
a descendant  of  a node  x if  there  is  a path  xq, x^, ...,x^  such  that 
x = xQ  , y = x^  and  each  xi+j_  is  a son  of  x^  . If,  in  addition, 
exactly  b nodes  x^  are  followed  by  their  right  sons  x^+^  then  y 
will  be  called  a b- descendant  of  x . Repeated  applications  of  (6.U)  show 
that 

if  y is  a b-descendant  of  x then  t(y)  < t(x)-b  . (6.5) 

We  claim  that 

there  is  a node  z such  that  S(z)  > n/2  and  such  that  \ 

> (6.6) 

l S(y ) | < |s(z)(  - bn/2t  for  every  b-descendant  y of  z . I 

A node  with  this  property  can  be  found  by  constructing  a certain  sequence 
y0,yi, . . . of  nodes  of  T such  that  y^  is  the  root  of  T . If  the  most 
recently  constructed  y^  has  a b-descendant  y such  that 
|S(y)|  > | S(yi ) | -bn/2t  then  set  yi+1  = y ; otherwise  stop.  By  (6.5)  and 
by  the  construction  of  the  sequence,  we  have 

1 S(y± ) | > n(l-bi/2t)  , t(y.)  < t-bi 

for  every  i . Since  t(y^)  > 0 , we  must  have  i < t/b  and  so 
|s(y.)|  > n/2  for  every  i . In  particular,  the  very  last  y^  in  the 
sequence  has  the  properties  required  of  z . We  shall  denote  S(z)  by  W . 
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With  each  descendant  x of  z , we  shall  associate  two  subsets 


A(x)  , B(x)  of  V : considering  the  path 

to  x^  = x we  shall  define 


A(x)  = 

{v(x.): 

0 

< i 

k 

and 

Xi+1 

B(x)  = 

lv(xi): 

0 

•H 

VI 

< k 

and 

Xi+1 

xQ,  x1, . . . , xk  from  xr.  = z 

is  the  left  son  of  x.  ] , 

1 J 

is  the  right  son  of  x.  } 


Clearly,  we  have 


|s(x)nw|  > | w|  - 1 (a(x)  - b(x) ) n w|  - L (i  + d(v))  (6.7 

vcB(x) 

for  every  descendant  x of  z . In  particular, 


if  |(A(x)  -B(x))  nw|  < a and  |b(x)|  < b then  S(x)  5/  0 : (6.8) 


just  observe  that 

a + b(l+d)  < 2a  + bd  < 3bd  < n/2  . 

Before  proceeding  to  construct  T*  , we  shall  associate  a node  x* 
with  each  descendant  x of  z such  that 

|(A(x)  - B(x) ) nw|  < a and  |b(x)|  < b . 

Consider  the  path  Xq,  x^,  . . . , such  that  x^  = x , x^  is  a leaf  of  T 

and  each  xi+j_  is  the  left  son  of  x^  . Note  that  B(x^)  = B(x)  for 

every  i . There  must  be  at  least  one  i such  that  0 < i < k and 

v(xi)eW  , v(x.)/A(x)UB(x)  ( .9) 

for  otherwise 

(A(xr)  -B^))  nw  = (A(x)  -B(x))  nw  and  B^)  = B(x) 

but  S(xk)  = 0 contradicting  (6.8).  We  shall  denote  the  first  x^ 
satisfying  (6.9)  by  x*  ; note  that 

(A(x* ) - B(x* ) ) 0 W = (A(x)  - B(x) ) PI  W 
At  last,  we  are  ready  to  construct  T*  . Each  of  its  nodes  x will 
come  from  T and  satisfy 
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v(x)  t W , v(x)  / A(x)  U B(x)  , 

|(A(x)  -B(x))  nw|  < a , |b(x)  | < b , B(x)  C W , 

# If 

'Hie  construction  of  T is  recursive;  the  root  of  T is  z . Suppose 

-if- 

that  we  have  already  constructed  some  node  x of  T ; let  x denote 
the  left  son  of  x in  T and  let  x^  denote  the  right  son  of  x in  T . 

If  |(A(x)  -B(x))fiw|  = a then  x will  have  no  left  son  in  T*  ; otherwise 
we  shall  make  x^  the  left  son  of  x in  T . If  | B (x ) | = b then  x 
will  have  no  right  son  in  T ; otherwise  we  shall  make  x^  the  right  son 
of  x in  T*  . It  will  be  useful  to  make  note  of  the  following  property 
of  T*  : 

along  each  path  from  the  root  to  a leaf, 
exactly  a nodes  are  followed  by  their  left  sons, 

(6.10) 

exactly  b nodes  are  followed  by  their  right  sons, 

and  these  a+b  nodes  x give  rise  to  distinct  vertices  v(x)  . 

Finally,  we  shall  prove  the  inequality  (6.3).  Without  loss  of 
generality,  we  may  assume  that  N(S)  ^ 0 and  so  S = S(y)  for  some  ycN  . 
Denote  by  H the  subgraph  of  G induced  by  W-S(y)  and  denote  by  m the 
order  of  H . Since  y is  a b-descendant  of  z in  T , (6,6)  implies  that 
m > bn/2t 

On  the  other  hand,  (6.7)  implies  that 

m < a + b(l+d)  < 2a  + bd  < 3hd  . 

Enumerate  the  vertices  of  H as  u^, Up,  ...,um  in  such  a way 

dH<ul>  ^ dH(u2)  ^ *•*  ^ W * 

■x  2 2 

Since  bn/2t  > u/l809tcd  and  since  G is  (d,  e)  -sparse,  the  graph  H 
2 

has  fewer  than  dm  /n  edges.  That  is, 
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£ du(u.  ) < 2iimt  /n 

i =1  ^ 1 

It  is  now  easy  to  see  that,  for  every  positive  integer  r , we  have 

2 

d^(ir)  < 2dm  /nr  whenever  i > r . 

We  shall  use  (6.11)  with 
r = |_am/^bdj 

Let  us  note  at  once  that  r > 200  and  so 
_ 200  am 

r > 20l  ‘ l^d 

It  will  be  also  useful  to  note  that 

2rbd  m 
a - 2 ’ 


2dm2b  _ 201  8b2d2 

nr  - 200  * an  * m 


f £2i  \2 

- V.  200  J ' 9 ’ 


r < a , 


2br  br  n 

a+b-1  - a - 1 


and  while  we  are  at  it,  let  us  also  verify  that 


201  bn 


201  m 


200  50t  200  25 


1 bn  m 

1000  2t  1000 


So  much  for  high-school  algebra.  Now,  we  shall  set  R = {u  L,  uu,, . . . , u j 
and  prove  that 

lB<x>  nBl  > ^71  (' 

for  every  xeN(S)  . To  begin  with,  (6.7),  (6.11)  and  p(x)  c W imply 
ra  < a + b + d(B(x)  f\Rl  +2dm2b/nr  . 
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If  (b.ll*)  failed  then  we  would  have 

2 

m < a + b + 2rbd/a  + 2dm  b/nr  . 

However,  the  right-hand  side  of  this  inequality  is  at  most 


( £01  . 

A + 

1 + 1 + 

f 201  ' 

\2 *  . M 

A 200 

25 

1000  2 

V_  200 

) 9j 

Hence  (6.1^)  must  hold. 

The  rest  is  easy.  Consider  the  subtree  of  T consisting  of  all 
the  paths  from  the  root  z to  leaves  in  N(S)  ; color  each  of  its  nodes 
x red  if  v(x)  eR  and  blue  otherwise.  By  (6.10)  and  (6.11+),  this  tree 
is  (a, b, r,2br/ (a+b-1) ) -constrained.  Because  of  (6.12)  and  (6.13), 
Lemma  5.1  applies  and  shows  that 

I«S)I  £ O • • 

By  virtue  of  (6.10),  this  is  the  desired  inequality  (6.3). 


Theorem  6.2.  Let  m be  a function  of  n such  that  m(n)  = o((n/log  n)  ) 


but  m(n)  > 1010n  for  all  sufficiently  large  n . Then,  for  almost  all 

graphs  G = (V, E)  with  n vertices  and  m edges,  every  recursive  proof 

of  (V,d(G))  has  length  at  least 

2 


j60m 


exp 


m ^ 350  log  2^- 


proof.  By  Lemma  2.2  and  by  Lemma  2.L,  almost  all  graphs  with  n vertices 


and  m edges  have  the  following  two  properties: 
2 

t 


(FI) 

(re) 


. n , m 

0(0)  < - loe  J 


every  subgraph  induced  by  s vertices  such  that 

Un?  . „ m 
s > — log  - 
— in  n 


2 2 

has  at  most  2ms  /n  edges. 
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We  shall  show  that  all  sufficiently  large  graphs  w'  h these  tv. 
properties  satisfy  the  conclusion  of  Theorem  6.2.  Let  us  define 


k(n) 

J • 

d(n) 

- Lkm/nr  , 

e(n) 

Un‘  ..  m 

= m — log  - 
km  ° n 

10 

Firstly,  we  shall  show  that  every  graph  with  n vertices  and  m _ 1 n 

edges  satisfying  (P2)  contains  an  induced  (d, e)  -sparse  subgraph  of 

order  k . To  begin  with,  2k(n)  < n . Next,  an  easy  averaging  argument 

shows  that  G contains  an  induced  subgraph  Hq  with  2k  vertices  and 
2 2 

at  most  tmk  /n  edges.  Beginning  with  HQ  , we  shall  construct  a sequence 
Hq, H-^, ...  of  induced  subgraphs  of  HQ  as  follows:  if  the  last  constructed 

H.  has  a vertex  v of  degree  at  least  d then  set  Hi+1  = H^-v  , otherwise 
stop.  Clearly,  if  an  gets  constructed  then  HQ  had  at  least  di 

iges  and  so  i < k . In  particular,  the  very  last  H.  in  the  sequence 
as  at  least  k vertices;  in  H . , we  shall  choose  an  induced  subgraph  H 

cl 

of  order  k . Let  W denote  the  set  of  vertices  of  H . By  (P2),  every 

£ 

subgraph  of  H with  s > ek  > tn  log(m/n)/m  vertices  has  at  most 
P P P 

2ms  /n  < ds  /k  edges.  Hence  H is  (d,  t)  -sparse. 

Next,  by  (Pi),  we  have 


a(G)  < ^ log  = 


On  the  other  hand,  we  have 

Ct{G)  > a(H)  > k/(d+l)  . 

For  all  sufficiently  large  n , Theorem  6.1  asserts  that  every  proof  of 
(W  , a(G))  has  length  at  least 


90d  k 300m  n2 

1 T eXp  ? > exP  —? ^TTT 

k 20dT  n mf  350  log  2 V 

By  Proposition  4.6,  this  is  also  a lower  bound  on  the  length  of  every 
recursive  proof  of  (V,a(G))  . □ 

Let  us  state  a special  case  of  Theorem  6.2  in  a compact  form. 

Theorem  6.3.  For  every  sufficiently  large  integer  d there  is  a 
constant  c > 1 with  the  following  property:  for  almost  all  graphs 

G = (V, E)  with  n vertices  and  dn  edges,  every  recursive  proof  of 
(V  , a(G))  has  length  at  least  cn  . 

In  closing,  two  remarks  may  be  in  order.  Firstly,  it  would  be  interesting 
to  construct  an  infinite  class  C,  of  graphs  for  which  there  is  a constant 
c > 1 with  the  following  property:  for  every  graph  G = (V, E)  in  Q,  and 

with  n vertices,  every  recursive  proof  of  (V,a(G))  has  length  at 
least  cn  . Secondly,  it  is  somewhat  frustrating  that  Theorem  6.2  does 

p 

not  apply  to  graphs  with  cn  edges.  Perhaps  the  following  is  true. 

Conjecture  6.1.  There  is  a positive  constant  c with  the  following 
property:  for  almost  all  graphs  G = (V, E)  with  n vertices,  every 

recursive  proof  of  (V,a(G))  has  length  at  least  n°  l0g  n . 
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Concluding  Remark.'. 


There  are  many  "natural"  proo!  systems  which  extend  our  system  of 
recursive  proofs;  we  shall  mention  just  a few.  It  would  be  interesting 
to  strengthen  our  results  by  proving  their  analogues  for  the  extended 
proof  systems. 

To  begin  with,  one  might  adjoin  the  following  inference  rule: 
from  (S1,t1)  and  (sp,t2)  we  can  deduce  (S^US,,  , t^  t-t^) 

Proposition  U.3  shows  that,  to  some  extent,  this  rule  is  implicit  in  the 

system  of  recursive  proofs.  Nevertheless,  its  addition  just  might  make 

the  system  considerably  more  powerful.  Along  this  line,  i'urther  extens'  . 

lead  to  the  system  of  cutting  plane  proofs  which  we  are  about  to  describe 

briefly.  Let  us  consider  a graph  G = (V,  E)  with  vertices  v^,  v0,  . . . , v() 

none  of  which  is  isolated.  A cutting  plane  proof  of  (V, t)  is  a sequence 

of  inequalities 
n 

Z a.  .x.  < b.  (i  = 1, 2,...,m) 

j=l  1J  J “ 1 

such  that 

(i)  all  the  numbers  a.  . and  b.  sire  nonnegative  integers, 

(ii)  for  every  k = 1,2,  ...,m  , either  the  k-th  inequality  reads 

x +x  <1  for  some  edge  v v or  else  there  are  nonnegative 
r s — r s 

multipliers  y^, Yp*  ' * '^k-l  such  that 
k-1 

Vij  i \) 

n 

(iii)  the  last  inequality  reads  Z x.  < t . 

j = i J 

It  is  not  difficult  to  see  that  a(G)  < t whenever  there  is  a cutting  plane 
proof  of  (V, t)  . The  converse  is  easy  as  well:  in  fact,  every  recursive 


k-1 

Z 

i =1 


yibi 


< bk 


Vj 


proof  of  (V, t)  can  be  converted  into  a catting  plane  proof  of  (V,  t)  . 

(The  details  are  left  to  the  reader.) 

An  inference  rule  which  strengthens  the  monotone  rule  and  is  not 
subsumed  in  the  notion  of  a cutting  plane  proof  goes  as  follows.  Let  us 
write  S-^  < if  there  is  a one-to-one  mapping  f:  -*  S0  such  that 

f(u)  and  f(v)  are  adjacent  only  if  u and  v are.  Clearly, 

from  (S,t)  we  can  derive  (S',t')  whenever  S'  < S and  t'  > t . 

Again,  it  would  be  interesting  to  find  out  whether  the  addition  of  this 
inference  rule  makes  the  system  of  recursive  proofs  considerably  stronger. 

Colin  McDiarmid  [18]  investigated  a proof  system,  similar  to  our  system 
of  recursive  proofs,  for  deriving  lower  bounds  on  the  chromatic  number  of 
graphs.  We  shall  describe  his  system  very  briefly.  Let  G be  a graph 
whose  vertices  are  labeled  by  nonempty  and  pairwise  disjoint  subsets  of 
{1, 2, ...,n}  ; let  u and  v be  two  vertices  of  G . We  shall  denote  by 
G'  the  graph  obtained  from  G by  adding  the  edge  uv  ; we  shall  denote  by 
G"  the  graph  obtained  from  G by  identifying  u with  v (in  which  case 
the  label  of  the  new  vertex  is  the  union  of  the  labels  of  u and  v ). 

As  usual,  iu(G)  denotes  the  order  of  the  largest  clique  in  G . By  a 
recursive  proof  of  [G^, t^]  , we  shall  mean  a sequence 

[G^,  tp J > i = 1, 2, . . .,m 

such  that,  for  each  k , either  t^  < uj(Gk)  or  else  there  are  subscripts 
i , j < k such  that  Gx  = G£  , G^  = G£  and  tR  = min(t.,tj)  . Clearly, 
if  there  is  a recursive  proof  of  [G„,t  ] then  v(G„)  > t . McDiarmid 
has  proved  that,  for  almost  all  graphs  with  n vertices,  every  recursive 
proof  of  [G,  x(q)1  haG  length  at  least 
exp(.157  n (log  n)1^  ) . 
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Hie  result  implies  that  the  average  running  time  the  •*  -t  ,1-lr- 
algorithm  for  finding  the  chromatic  number  of  a graph  [t]  grows  f-i.-i- 
than  every  exponential.  On  the  other  hand,  Lawler  [T  J has  designed 
algorithm  for  finding  the  chromatic  number  of  a graph  of  order  n w 
0(2.1t5  ) steps.  (Of  course,  these  facts  are  of  an  asyaj: to*  ie  tire  ir- 
imply  nothing  about  the  relative  merits  of  the  two  algorim  ■;  aj  . : 1 

graphs  with,  say,  200  vertices.) 

Finally,  I wish  to  thank  several  friends  for  ^heir  belt  with  my 
on  this  paper.  To  Colin  McDiarmid  ni  share  Gzemerddi  1 n indebted 
many  stimulating  conversations,  persi  Diaconis  told  me  abou' 
taper  [If].  David  Avis,  Don  Knuth,  Tvo  Rosenborg,  and  y rjan  r a 
various  versions  of  the  manuscript  and  made  many  helpful  suggestion, 
improve  the  presentation. 
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